Can GC content at third-codon positions be used as a proxy for isochore composition?
نویسندگان
چکیده
The isochore theory depicts the genomes of warm-blooded vertebrates as a mosaic of long genomic regions that are characterized by relatively homogeneous GC content. In the absence of genomic data, the GC content at third-codon positions of protein-coding genes (GC3) was commonly used as a proxy for the GC content of isochores. Oddly, in the postgenomic era, GC3 is still sometimes used as a proxy for the GC composition of isochores. Here, we use genic and genomic sequences from human, chimpanzee, cow, mouse, rat, chicken, and zebrafish to show that GC3 only explains a very small proportion of the variation in GC content of long genomic sequences flanking the genes (GCf), and what little correlation there is between GC3 and GCf was found to decay rapidly with distance from the gene. The coefficient of variation of GC3 was found to be much larger than that of GCf and, therefore, GC3 and GCf values are not comparable with each other. Comparisons of orthologous gene pairs from 1) human and chimpanzee and 2) mouse and rat show strong correlations between their GC3 values, but very weak correlations between their GCf values. We conclude that the GC content of third-codon position cannot be used as stand-in for isochoric composition.
منابع مشابه
Warm-blooded isochore structure in Nile crocodile and turtle.
The genomes of warm-blooded vertebrates are characterized by a strong heterogeneity in base composition, with GC-rich and GC-poor isochores. The GC content of sequences, especially in third codon positions, is highly correlated with that of the isochore they are embedded in. In amphibian and fish genomes, GC-rich isochores are nearly absent. Thus, it has been proposed that the GC increase in a ...
متن کاملIsochore evolution in mammals: a human-like ancestral structure.
Codon usage in mammals is mainly determined by the spatial arrangement of genomic G + C-content, i.e., the isochore structure. Ancestral G + C-content at third codon positions of 27 nuclear protein-coding genes of eutherian mammals was estimated by maximum-likelihood analysis on the basis of a nonhomogeneous DNA substitution model, accounting for variable base compositions among present-day seq...
متن کاملEvolution of isochores in rodents.
The most deviant isochore pattern within mammals was found in rat and mouse; most other mammals possess a different kind of isochore organization called the "general pattern." However, isochore patterns remain largely unknown in rodents other than mouse and rat. To investigate the taxonomic distribution of isochore patterns in rodents, we sequenced the nuclear gene LCAT (lecithin:cholesterol ac...
متن کاملStudy of Completed Archaeal Genomes and Proteomes: Hypothesis of Strong Mutational AT Pressure Existed in Their Common Predecessor
The number of completely sequenced archaeal genomes has been sufficient for a large-scale bioinformatic study. We have conducted analyses for each coding region from 36 archaeal genomes using the original CGS algorithm by calculating the total GC content (G+C), GC content in first, second and third codon positions as well as in fourfold and twofold degenerated sites from third codon positions, ...
متن کاملEvolution of base composition in the insulin and insulin-like growth factor genes.
The genomes of homeothermic (warm-blooded) vertebrates are mosaic interspersions of homogeneously GC-rich and GC-poor regions (isochores). Evolution of genome compartmentalization and GC-rich isochores is hypothesized to reflect either selective advantages of an elevated GC content or chromosome location and mutational pressure associated with the timing of DNA replication in germ cells. To add...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Molecular biology and evolution
دوره 26 8 شماره
صفحات -
تاریخ انتشار 2009